Overview

Dataset statistics

Number of variables43
Number of observations114
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory38.4 KiB
Average record size in memory345.1 B

Variable types

BOOL25
NUM17
CAT1

Reproduction

Analysis started2020-05-14 08:37:29.047263
Analysis finished2020-05-14 08:38:43.009477
Duration1 minute and 13.96 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

numSub is highly correlated with numPal and 2 other fieldsHigh correlation
numPal is highly correlated with numSub and 3 other fieldsHigh correlation
numVrb is highly correlated with numPal and 2 other fieldsHigh correlation
numDet is highly correlated with numPal and 3 other fieldsHigh correlation
numAdv is highly correlated with numVrbHigh correlation
numAdp is highly correlated with numPal and 2 other fieldsHigh correlation
Mídia has 81 (71.1%) zeros Zeros
Links I. has 94 (82.5%) zeros Zeros
Links E. has 6 (5.3%) zeros Zeros
numPar has 5 (4.4%) zeros Zeros
numNum has 8 (7.0%) zeros Zeros
tamTitulo has 2 (1.8%) zeros Zeros

Variables

Mídia
Real number (ℝ≥0)

ZEROS

Distinct count6
Unique (%)5.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.10380116959064328
Minimum0.0
Maximum0.9999999999999999
Zeros81
Zeros (%)71.1%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.1666666667
95-th percentile0.5583333333
Maximum1
Range1
Interquartile range (IQR)0.1666666667

Descriptive statistics

Standard deviation0.2072794594
Coefficient of variation (CV)1.99688944
Kurtosis6.130033713
Mean0.1038011696
Median Absolute Deviation (MAD)0
Skewness2.43914056
Sum11.83333333
Variance0.04296477428
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
08171.1%
 
0.16666666671513.2%
 
0.333333333387.0%
 
0.666666666743.5%
 
0.543.5%
 
121.8%
 
ValueCountFrequency (%) 
08171.1%
 
0.16666666671513.2%
 
0.333333333387.0%
 
0.543.5%
 
0.666666666743.5%
 
ValueCountFrequency (%) 
121.8%
 
0.666666666743.5%
 
0.543.5%
 
0.333333333387.0%
 
0.16666666671513.2%
 

SEO
Boolean

Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
1
96
0
 
18
ValueCountFrequency (%) 
19684.2%
 
01815.8%
 

Links I.
Real number (ℝ≥0)

ZEROS

Distinct count7
Unique (%)6.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.05639097744360902
Minimum0.0
Maximum1.0
Zeros94
Zeros (%)82.5%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0.2857142857
Maximum1
Range1
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.1608385344
Coefficient of variation (CV)2.852203344
Kurtosis17.10719987
Mean0.05639097744
Median Absolute Deviation (MAD)0
Skewness3.902916871
Sum6.428571429
Variance0.02586903415
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
09482.5%
 
0.142857142997.9%
 
0.285714285765.3%
 
0.428571428621.8%
 
0.714285714310.9%
 
0.857142857110.9%
 
110.9%
 
ValueCountFrequency (%) 
09482.5%
 
0.142857142997.9%
 
0.285714285765.3%
 
0.428571428621.8%
 
0.714285714310.9%
 
ValueCountFrequency (%) 
110.9%
 
0.857142857110.9%
 
0.714285714310.9%
 
0.428571428621.8%
 
0.285714285765.3%
 

Links E.
Real number (ℝ≥0)

ZEROS

Distinct count24
Unique (%)21.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.13413078149920254
Minimum0.0
Maximum1.0
Zeros6
Zeros (%)5.3%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0.01181818182
Q10.05454545455
median0.09090909091
Q30.1454545455
95-th percentile0.4181818182
Maximum1
Range1
Interquartile range (IQR)0.09090909091

Descriptive statistics

Standard deviation0.1576984756
Coefficient of variation (CV)1.175706827
Kurtosis12.37094612
Mean0.1341307815
Median Absolute Deviation (MAD)0.05454545455
Skewness3.193515266
Sum15.29090909
Variance0.02486880919
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.090909090911311.4%
 
0.054545454551210.5%
 
0.072727272731210.5%
 
0.036363636361210.5%
 
0.109090909187.0%
 
0.145454545587.0%
 
0.0181818181887.0%
 
0.127272727387.0%
 
065.3%
 
0.163636363654.4%
 
Other values (14)2219.3%
 
ValueCountFrequency (%) 
065.3%
 
0.0181818181887.0%
 
0.036363636361210.5%
 
0.054545454551210.5%
 
0.072727272731210.5%
 
ValueCountFrequency (%) 
110.9%
 
0.836363636410.9%
 
0.745454545510.9%
 
0.563636363610.9%
 
0.509090909110.9%
 

Complexidade
Categorical

Distinct count3
Unique (%)2.6%
Missing0
Missing (%)0.0%
Memory size912.0 B
0.5
63
0
34
1
17
ValueCountFrequency (%) 
0.56355.3%
 
03429.8%
 
11714.9%
 

Length

Max length3
Median length3
Mean length3
Min length3
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
1
102
0
 
12
ValueCountFrequency (%) 
110289.5%
 
01210.5%
 

Analogias
Boolean

Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
98
1
 
16
ValueCountFrequency (%) 
09886.0%
 
11614.0%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
1
86
0
28
ValueCountFrequency (%) 
18675.4%
 
02824.6%
 

Siglas
Boolean

Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
1
91
0
23
ValueCountFrequency (%) 
19179.8%
 
02320.2%
 

numPal
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count108
Unique (%)94.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.32210594052699315
Minimum0.0
Maximum1.0
Zeros1
Zeros (%)0.9%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0.0740990991
Q10.1449485199
median0.2715572716
Q30.4475546976
95-th percentile0.7150900901
Maximum1
Range1
Interquartile range (IQR)0.3026061776

Descriptive statistics

Standard deviation0.2162244715
Coefficient of variation (CV)0.6712837124
Kurtosis0.3037093515
Mean0.3221059405
Median Absolute Deviation (MAD)0.1447876448
Skewness0.8874607834
Sum36.72007722
Variance0.04675302209
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.0913770913832.6%
 
0.305662805721.8%
 
0.456885456921.8%
 
0.126769626821.8%
 
0.351351351421.8%
 
0.259330759310.9%
 
0.563706563710.9%
 
0.653796653810.9%
 
0.595881595910.9%
 
0.088803088810.9%
 
Other values (98)9886.0%
 
ValueCountFrequency (%) 
010.9%
 
0.0212355212410.9%
 
0.0437580437610.9%
 
0.0572715572710.9%
 
0.0624195624210.9%
 
ValueCountFrequency (%) 
110.9%
 
0.964607464610.9%
 
0.803732303710.9%
 
0.796653796710.9%
 
0.732303732310.9%
 

numPar
Real number (ℝ≥0)

ZEROS

Distinct count25
Unique (%)21.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.18114035087719296
Minimum0.0
Maximum1.0
Zeros5
Zeros (%)4.4%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0.025
Q10.05
median0.125
Q30.275
95-th percentile0.4925
Maximum1
Range1
Interquartile range (IQR)0.225

Descriptive statistics

Standard deviation0.1714578757
Coefficient of variation (CV)0.9465471103
Kurtosis4.504634068
Mean0.1811403509
Median Absolute Deviation (MAD)0.1
Skewness1.775819833
Sum20.65
Variance0.02939780314
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.0251614.0%
 
0.1251412.3%
 
0.05119.6%
 
0.07597.9%
 
0.197.9%
 
0.22565.3%
 
0.27554.4%
 
0.354.4%
 
054.4%
 
0.32543.5%
 
Other values (15)3026.3%
 
ValueCountFrequency (%) 
054.4%
 
0.0251614.0%
 
0.05119.6%
 
0.07597.9%
 
0.197.9%
 
ValueCountFrequency (%) 
110.9%
 
0.72510.9%
 
0.67510.9%
 
0.5521.8%
 
0.52510.9%
 

numSub
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count94
Unique (%)82.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.28988010566957934
Minimum0.0
Maximum1.0
Zeros1
Zeros (%)0.9%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0.05144787645
Q10.125
median0.2480694981
Q30.4054054054
95-th percentile0.6835907336
Maximum1
Range1
Interquartile range (IQR)0.2804054054

Descriptive statistics

Standard deviation0.2088849697
Coefficient of variation (CV)0.7205909119
Kurtosis0.804574093
Mean0.2898801057
Median Absolute Deviation (MAD)0.1332046332
Skewness1.034063949
Sum33.04633205
Variance0.04363293056
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.0830115830132.6%
 
0.550193050232.6%
 
0.191119691132.6%
 
0.0579150579232.6%
 
0.210424710432.6%
 
0.409266409332.6%
 
0.220077220121.8%
 
0.287644787621.8%
 
0.0907335907321.8%
 
0.0559845559821.8%
 
Other values (84)8877.2%
 
ValueCountFrequency (%) 
010.9%
 
0.0212355212410.9%
 
0.0231660231710.9%
 
0.0405405405410.9%
 
0.0482625482610.9%
 
ValueCountFrequency (%) 
110.9%
 
0.882239382210.9%
 
0.812741312710.9%
 
0.789575289610.9%
 
0.779922779910.9%
 

numAdj
Real number (ℝ≥0)

Distinct count57
Unique (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.33293820135925395
Minimum0.0
Maximum1.0
Zeros1
Zeros (%)0.9%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0.06891891892
Q10.1734234234
median0.2792792793
Q30.4684684685
95-th percentile0.8108108108
Maximum1
Range1
Interquartile range (IQR)0.295045045

Descriptive statistics

Standard deviation0.2163171465
Coefficient of variation (CV)0.649721617
Kurtosis0.6579722893
Mean0.3329382014
Median Absolute Deviation (MAD)0.1576576577
Skewness0.9354386619
Sum37.95495495
Variance0.04679310789
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.108108108176.1%
 
0.216216216254.4%
 
0.198198198254.4%
 
0.234234234254.4%
 
0.423423423454.4%
 
0.513513513543.5%
 
0.477477477543.5%
 
0.810810810832.6%
 
0.459459459532.6%
 
0.099099099132.6%
 
Other values (47)7061.4%
 
ValueCountFrequency (%) 
010.9%
 
0.0450450450521.8%
 
0.0540540540521.8%
 
0.0630630630610.9%
 
0.0720720720721.8%
 
ValueCountFrequency (%) 
110.9%
 
0.97297297310.9%
 
0.918918918910.9%
 
0.846846846810.9%
 
0.810810810832.6%
 

numVrb
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count83
Unique (%)72.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.27846708024814887
Minimum0.0
Maximum0.9999999999999999
Zeros1
Zeros (%)0.9%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0.05190114068
Q10.1368821293
median0.2414448669
Q30.3935361217
95-th percentile0.6199619772
Maximum1
Range1
Interquartile range (IQR)0.2566539924

Descriptive statistics

Standard deviation0.1888550136
Coefficient of variation (CV)0.6781951155
Kurtosis1.059146551
Mean0.2784670802
Median Absolute Deviation (MAD)0.1197718631
Skewness1.035328565
Sum31.74524715
Variance0.03566621618
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.197718631243.5%
 
0.547528517143.5%
 
0.152091254843.5%
 
0.155893536132.6%
 
0.395437262432.6%
 
0.266159695832.6%
 
0.205323193921.8%
 
0.532319391621.8%
 
0.0646387832721.8%
 
0.235741444921.8%
 
Other values (73)8574.6%
 
ValueCountFrequency (%) 
010.9%
 
0.0266159695810.9%
 
0.0304182509521.8%
 
0.0418250950610.9%
 
0.0494296577910.9%
 
ValueCountFrequency (%) 
110.9%
 
0.779467680610.9%
 
0.688212927810.9%
 
0.684410646410.9%
 
0.661596958210.9%
 

numNEs
Real number (ℝ≥0)

Distinct count70
Unique (%)61.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.16950270412874294
Minimum0.0
Maximum1.0
Zeros1
Zeros (%)0.9%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0.01503759398
Q10.04229323308
median0.09962406015
Q30.2359022556
95-th percentile0.4642857143
Maximum1
Range1
Interquartile range (IQR)0.1936090226

Descriptive statistics

Standard deviation0.1786284575
Coefficient of variation (CV)1.053838394
Kurtosis5.248303806
Mean0.1695027041
Median Absolute Deviation (MAD)0.06954887218
Skewness2.012569004
Sum19.32330827
Variance0.03190812584
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.0375939849676.1%
 
0.0300751879754.4%
 
0.0187969924843.5%
 
0.218045112832.6%
 
0.097744360932.6%
 
0.0601503759432.6%
 
0.0150375939832.6%
 
0.0902255639132.6%
 
0.0676691729332.6%
 
0.0488721804532.6%
 
Other values (60)7767.5%
 
ValueCountFrequency (%) 
010.9%
 
0.00375939849610.9%
 
0.00751879699210.9%
 
0.0112781954921.8%
 
0.0150375939832.6%
 
ValueCountFrequency (%) 
110.9%
 
0.845864661710.9%
 
0.763157894710.9%
 
0.53759398510.9%
 
0.49248120310.9%
 

numDet
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count71
Unique (%)62.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.37809011164274325
Minimum0.0
Maximum1.0
Zeros1
Zeros (%)0.9%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0.07755681818
Q10.1732954545
median0.3267045455
Q30.5170454545
95-th percentile0.8599431818
Maximum1
Range1
Interquartile range (IQR)0.34375

Descriptive statistics

Standard deviation0.2492149924
Coefficient of variation (CV)0.6591417885
Kurtosis-0.4038583686
Mean0.3780901116
Median Absolute Deviation (MAD)0.1789772727
Skewness0.7391354662
Sum43.10227273
Variance0.06210811245
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.147727272754.4%
 
0.301136363654.4%
 
0.113636363643.5%
 
0.863636363643.5%
 
0.278409090943.5%
 
0.329545454532.6%
 
0.363636363632.6%
 
0.232954545532.6%
 
0.852272727332.6%
 
0.0681818181832.6%
 
Other values (61)7767.5%
 
ValueCountFrequency (%) 
010.9%
 
0.0568181818210.9%
 
0.0681818181832.6%
 
0.0738636363610.9%
 
0.0795454545521.8%
 
ValueCountFrequency (%) 
121.8%
 
0.863636363643.5%
 
0.857954545510.9%
 
0.852272727332.6%
 
0.823863636410.9%
 

numConj
Real number (ℝ≥0)

Distinct count43
Unique (%)37.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2842348927875244
Minimum0.0
Maximum1.0
Zeros1
Zeros (%)0.9%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0.05555555556
Q10.1423611111
median0.2222222222
Q30.3854166667
95-th percentile0.6770833333
Maximum1
Range1
Interquartile range (IQR)0.2430555556

Descriptive statistics

Standard deviation0.1987356571
Coefficient of variation (CV)0.6991951452
Kurtosis1.70063152
Mean0.2842348928
Median Absolute Deviation (MAD)0.09722222222
Skewness1.306531005
Sum32.40277778
Variance0.03949586141
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.152777777876.1%
 
0.138888888976.1%
 
0.277777777865.3%
 
0.416666666765.3%
 
0.263888888954.4%
 
0.111111111154.4%
 
0.180555555654.4%
 
0.208333333354.4%
 
0.0694444444454.4%
 
0.0416666666743.5%
 
Other values (33)5951.8%
 
ValueCountFrequency (%) 
010.9%
 
0.0416666666743.5%
 
0.0555555555621.8%
 
0.0694444444454.4%
 
0.0972222222232.6%
 
ValueCountFrequency (%) 
110.9%
 
0.902777777810.9%
 
0.861111111110.9%
 
0.763888888910.9%
 
0.736111111110.9%
 

numAdv
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count59
Unique (%)51.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2935334203276787
Minimum0.0
Maximum1.0
Zeros1
Zeros (%)0.9%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0.05495867769
Q10.132231405
median0.2561983471
Q30.3801652893
95-th percentile0.6623966942
Maximum1
Range1
Interquartile range (IQR)0.2479338843

Descriptive statistics

Standard deviation0.201803883
Coefficient of variation (CV)0.6874988297
Kurtosis0.9507502765
Mean0.2935334203
Median Absolute Deviation (MAD)0.1239669421
Skewness1.040253032
Sum33.46280992
Variance0.04072480718
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.0991735537254.4%
 
0.165289256254.4%
 
0.247933884354.4%
 
0.123966942143.5%
 
0.355371900843.5%
 
0.586776859532.6%
 
0.272727272732.6%
 
0.380165289332.6%
 
0.297520661232.6%
 
0.314049586832.6%
 
Other values (49)7666.7%
 
ValueCountFrequency (%) 
010.9%
 
0.0082644628110.9%
 
0.0330578512421.8%
 
0.0413223140510.9%
 
0.0495867768610.9%
 
ValueCountFrequency (%) 
110.9%
 
0.834710743810.9%
 
0.793388429832.6%
 
0.710743801710.9%
 
0.636363636410.9%
 

numAdp
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count80
Unique (%)70.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.33509490940465914
Minimum0.0
Maximum1.0
Zeros1
Zeros (%)0.9%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0.07786885246
Q10.1680327869
median0.2868852459
Q30.4518442623
95-th percentile0.7616803279
Maximum1
Range1
Interquartile range (IQR)0.2838114754

Descriptive statistics

Standard deviation0.2208719883
Coefficient of variation (CV)0.6591326281
Kurtosis0.5134896061
Mean0.3350949094
Median Absolute Deviation (MAD)0.1516393443
Skewness0.9699515946
Sum38.20081967
Variance0.04878443522
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.2543.5%
 
0.180327868943.5%
 
0.0983606557443.5%
 
0.0778688524632.6%
 
0.188524590232.6%
 
0.0696721311532.6%
 
0.114754098432.6%
 
0.135245901632.6%
 
0.286885245932.6%
 
0.303278688521.8%
 
Other values (70)8271.9%
 
ValueCountFrequency (%) 
010.9%
 
0.0409836065610.9%
 
0.0696721311532.6%
 
0.0778688524632.6%
 
0.0819672131110.9%
 
ValueCountFrequency (%) 
110.9%
 
0.98770491810.9%
 
0.946721311510.9%
 
0.807377049210.9%
 
0.803278688510.9%
 

numNum
Real number (ℝ≥0)

ZEROS

Distinct count33
Unique (%)28.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1650375939849624
Minimum0.0
Maximum1.0
Zeros8
Zeros (%)7.0%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0
Q10.04285714286
median0.1142857143
Q30.1857142857
95-th percentile0.5485714286
Maximum1
Range1
Interquartile range (IQR)0.1428571429

Descriptive statistics

Standard deviation0.1870861719
Coefficient of variation (CV)1.133597306
Kurtosis4.985246714
Mean0.165037594
Median Absolute Deviation (MAD)0.07142857143
Skewness2.140099055
Sum18.81428571
Variance0.03500123571
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.02857142857108.8%
 
0.1108.8%
 
0.0142857142997.9%
 
087.0%
 
0.142857142987.0%
 
0.128571428676.1%
 
0.114285714365.3%
 
0.0714285714354.4%
 
0.171428571454.4%
 
0.0428571428654.4%
 
Other values (23)4136.0%
 
ValueCountFrequency (%) 
087.0%
 
0.0142857142997.9%
 
0.02857142857108.8%
 
0.0428571428654.4%
 
0.0571428571443.5%
 
ValueCountFrequency (%) 
110.9%
 
0.814285714310.9%
 
0.742857142910.9%
 
0.728571428610.9%
 
0.671428571410.9%
 

Pergunta
Boolean

Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
70
1
44
ValueCountFrequency (%) 
07061.4%
 
14438.6%
 

tamParagraf
Real number (ℝ≥0)

Distinct count105
Unique (%)92.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.403849981517814
Minimum0.0
Maximum1.0
Zeros1
Zeros (%)0.9%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0.07277147488
Q10.2313614263
median0.4108589951
Q30.5421393841
95-th percentile0.7739059968
Maximum1
Range1
Interquartile range (IQR)0.3107779579

Descriptive statistics

Standard deviation0.2162254215
Coefficient of variation (CV)0.5354102548
Kurtosis-0.4555053152
Mean0.4038499815
Median Absolute Deviation (MAD)0.1653160454
Skewness0.2388369147
Sum46.03889789
Variance0.04675343291
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.468395461943.5%
 
0.277147487821.8%
 
0.487844408421.8%
 
0.111831442521.8%
 
0.303079416521.8%
 
0.199351701821.8%
 
0.494327390621.8%
 
0.633711507310.9%
 
0.102106969210.9%
 
0.562398703410.9%
 
Other values (95)9583.3%
 
ValueCountFrequency (%) 
010.9%
 
0.0145867098910.9%
 
0.0194489465210.9%
 
0.0210696920610.9%
 
0.0534846029210.9%
 
ValueCountFrequency (%) 
110.9%
 
0.839546191210.9%
 
0.833063209110.9%
 
0.831442463510.9%
 
0.81199351710.9%
 

tamTitulo
Real number (ℝ≥0)

ZEROS

Distinct count53
Unique (%)46.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.40622389306599826
Minimum0.0
Maximum0.9999999999999999
Zeros2
Zeros (%)1.8%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0.1464285714
Q10.2380952381
median0.369047619
Q30.5357142857
95-th percentile0.7619047619
Maximum1
Range1
Interquartile range (IQR)0.2976190476

Descriptive statistics

Standard deviation0.2038052577
Coefficient of variation (CV)0.5017067218
Kurtosis-0.2591127756
Mean0.4062238931
Median Absolute Deviation (MAD)0.1547619048
Skewness0.4549845528
Sum46.30952381
Variance0.04153658306
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.357142857165.3%
 
0.345238095254.4%
 
0.452380952454.4%
 
0.178571428654.4%
 
0.535714285754.4%
 
0.214285714343.5%
 
0.416666666743.5%
 
0.392857142943.5%
 
0.238095238143.5%
 
0.285714285732.6%
 
Other values (43)6960.5%
 
ValueCountFrequency (%) 
021.8%
 
0.0476190476210.9%
 
0.0833333333310.9%
 
0.0952380952410.9%
 
0.13095238110.9%
 
ValueCountFrequency (%) 
110.9%
 
0.845238095221.8%
 
0.797619047621.8%
 
0.761904761921.8%
 
0.7510.9%
 

Dias
Real number (ℝ≥0)

Distinct count113
Unique (%)99.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.550298916080009
Minimum0.0
Maximum1.0
Zeros1
Zeros (%)0.9%
Memory size912.0 B

Quantile statistics

Minimum0
5-th percentile0.06297770701
Q10.2925955414
median0.5684713376
Q30.8296178344
95-th percentile0.9300159236
Maximum1
Range1
Interquartile range (IQR)0.537022293

Descriptive statistics

Standard deviation0.3007203354
Coefficient of variation (CV)0.5464672502
Kurtosis-1.294645202
Mean0.5502989161
Median Absolute Deviation (MAD)0.2675159236
Skewness-0.2374462067
Sum62.73407643
Variance0.09043272014
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.668789808921.8%
 
0.445859872610.9%
 
0.713375796210.9%
 
0.234076433110.9%
 
0.0445859872610.9%
 
0.267515923610.9%
 
0.757961783410.9%
 
0.111464968210.9%
 
0.786624203810.9%
 
0.144904458610.9%
 
Other values (103)10390.4%
 
ValueCountFrequency (%) 
010.9%
 
0.0111464968210.9%
 
0.0222929936310.9%
 
0.0334394904510.9%
 
0.0445859872610.9%
 
ValueCountFrequency (%) 
110.9%
 
0.987261146510.9%
 
0.985668789810.9%
 
0.980891719710.9%
 
0.936305732510.9%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
110
1
 
4
ValueCountFrequency (%) 
011096.5%
 
143.5%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
109
1
 
5
ValueCountFrequency (%) 
010995.6%
 
154.4%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
73
1
41
ValueCountFrequency (%) 
07364.0%
 
14136.0%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
78
1
36
ValueCountFrequency (%) 
07868.4%
 
13631.6%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
108
1
 
6
ValueCountFrequency (%) 
010894.7%
 
165.3%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
101
1
 
13
ValueCountFrequency (%) 
010188.6%
 
11311.4%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
107
1
 
7
ValueCountFrequency (%) 
010793.9%
 
176.1%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
112
1
 
2
ValueCountFrequency (%) 
011298.2%
 
121.8%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
107
1
 
7
ValueCountFrequency (%) 
010793.9%
 
176.1%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
112
1
 
2
ValueCountFrequency (%) 
011298.2%
 
121.8%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
96
1
 
18
ValueCountFrequency (%) 
09684.2%
 
11815.8%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
94
1
20
ValueCountFrequency (%) 
09482.5%
 
12017.5%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
99
1
 
15
ValueCountFrequency (%) 
09986.8%
 
11513.2%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
103
1
 
11
ValueCountFrequency (%) 
010390.4%
 
1119.6%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
109
1
 
5
ValueCountFrequency (%) 
010995.6%
 
154.4%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
104
1
 
10
ValueCountFrequency (%) 
010491.2%
 
1108.8%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
110
1
 
4
ValueCountFrequency (%) 
011096.5%
 
143.5%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
94
1
20
ValueCountFrequency (%) 
09482.5%
 
12017.5%
 
Distinct count2
Unique (%)1.8%
Missing0
Missing (%)0.0%
Memory size912.0 B
0
112
1
 
2
ValueCountFrequency (%) 
011298.2%
 
121.8%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

Sample

First rows

MídiaSEOLinks I.Links E.ComplexidadeIntroduçãoAnalogiasInteraçãoSiglasnumPalnumParnumSubnumAdjnumVrbnumNEsnumDetnumConjnumAdvnumAdpnumNumPerguntatamParagraftamTituloDiascategoria-ABC da ciênciacategoria-Ciência Popcategoria-Ciência ao redorcategoria-O que que a ciência tem?categoria-Outroscategoria-Profissão Cientistacategoria-Sci… what?categoria-Você disse ciência?área-Astronomiaárea-Atualidadesárea-Biologiaárea-Ciênciaárea-Físicaárea-Históriaárea-Matemáticaárea-Medicinaárea-Psicologiaárea-Químicaárea-Tecnologia
00.0000000.00.00.0363640.01.00.01.01.00.0913770.0750.1235520.1441440.0646390.1654140.0795450.0972220.0743800.0983610.0571431.00.2220420.3928571.0000001.00.00.00.00.00.00.00.00.00.00.01.00.00.00.00.00.00.00.0
10.1666671.00.00.0545450.01.00.01.00.00.2117120.3000.2895750.2072070.1368820.2857140.2329550.2638890.0578510.2418030.1142860.00.0534850.1785710.9872611.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.01.00.0
21.0000001.00.00.5090910.51.01.01.00.00.4568850.6750.4806950.2522520.4866920.3984960.5284090.3472220.3884300.4385250.2285711.00.0145870.6904760.9856690.00.00.00.01.00.00.00.00.00.00.00.00.00.00.00.00.00.01.0
30.0000001.00.00.0545450.51.01.01.00.00.0765770.0000.0907340.0540540.0722430.0639100.0681820.1388890.0330580.1352460.0571430.00.5559160.1547620.9808920.00.00.00.00.00.01.00.00.00.01.00.00.00.00.00.00.00.00.0
40.0000001.00.00.0000001.00.00.00.01.00.2277990.1750.2972970.2342340.1178710.3120300.2500000.1944440.1157020.3360660.2000000.00.2739060.5833330.9363060.00.00.00.00.00.01.00.00.00.00.00.00.00.00.00.00.01.00.0
50.0000001.00.00.1454550.51.01.01.00.00.1267700.0750.0830120.0990990.1102660.0902260.1477270.0416670.1239670.1352460.2428570.00.2609400.2976190.9331210.00.00.00.00.00.01.00.00.00.01.00.00.00.00.00.00.00.00.0
60.0000001.00.00.0545451.01.00.01.01.00.1267700.0250.1332050.1081080.1216730.0676690.1477270.1527780.0743800.1639340.0000001.00.4878440.3452380.9283440.00.01.00.00.00.00.00.00.00.00.00.00.00.00.00.00.01.00.0
70.0000001.00.00.1272731.00.00.00.01.00.0913770.0250.0579150.0000000.1140680.0300750.1477270.1111110.0826450.1270490.1000001.00.3452190.5357140.9251590.00.00.01.00.00.00.00.00.00.00.00.00.00.00.00.00.01.00.0
80.0000000.00.00.0000001.01.01.01.00.00.2406690.0750.2162160.0990990.2357410.0375940.3863640.1527780.2396690.2868850.0000001.00.4975690.2857140.9219750.00.00.01.00.00.00.00.00.00.00.00.01.00.00.00.00.00.00.0
90.1666671.00.00.0181820.51.00.01.00.00.5958820.3500.5965250.5045050.4410650.4511280.7045450.2500000.5454550.6516390.1000000.00.3354940.7619050.9203820.00.00.00.00.01.00.00.01.00.00.00.00.00.00.00.00.00.00.0

Last rows

MídiaSEOLinks I.Links E.ComplexidadeIntroduçãoAnalogiasInteraçãoSiglasnumPalnumParnumSubnumAdjnumVrbnumNEsnumDetnumConjnumAdvnumAdpnumNumPerguntatamParagraftamTituloDiascategoria-ABC da ciênciacategoria-Ciência Popcategoria-Ciência ao redorcategoria-O que que a ciência tem?categoria-Outroscategoria-Profissão Cientistacategoria-Sci… what?categoria-Você disse ciência?área-Astronomiaárea-Atualidadesárea-Biologiaárea-Ciênciaárea-Físicaárea-Históriaárea-Matemáticaárea-Medicinaárea-Psicologiaárea-Químicaárea-Tecnologia
1040.0000000.00.0000000.0363640.51.00.01.01.00.2323040.1500.1930500.1801800.1977190.0413530.3579550.0694440.2727270.2500000.1428571.00.2431120.5714290.0891720.00.01.00.00.00.00.00.00.00.00.00.00.00.00.00.00.01.00.0
1050.3333331.00.7142860.7454550.51.01.01.00.00.6537970.5250.5984560.6396400.5475290.2293230.6363640.4583330.4958680.6516390.7285710.00.1977310.2738100.0875800.00.01.00.00.00.00.00.00.00.00.01.00.00.00.00.00.00.00.0
1060.0000001.00.0000000.2363640.51.00.01.01.00.1505790.1000.0656370.1981980.2129280.0263160.1590910.2222220.2479340.1065570.0142861.00.2771470.3214290.0780250.00.00.01.00.00.00.00.00.00.00.00.00.00.00.00.00.00.01.0
1070.0000001.00.2857140.2363640.01.00.01.00.00.5199490.1250.3339770.6576580.5475290.1766920.5113640.6250000.4710740.4344260.6714290.00.8395460.4285710.0668790.00.00.00.01.00.00.00.00.00.00.01.00.00.00.00.00.00.00.0
1080.3333331.00.0000000.0545450.51.00.01.01.00.5045050.2750.4092660.6126130.4486690.1090230.6818180.5694440.3388430.4426230.1714290.00.3841170.1666670.0557320.00.00.01.00.00.00.00.00.00.00.00.00.00.00.00.00.01.00.0
1090.1666671.00.0000000.1818180.50.00.00.01.00.2316600.1250.2104250.4774770.1520910.0977440.3465910.3888890.1487600.1803280.0571430.00.3970830.2261900.0445860.00.00.01.00.00.00.00.00.00.00.00.00.00.00.00.00.01.00.0
1100.0000001.00.0000000.1454550.51.01.01.01.00.3764480.5500.2200770.5225230.3764260.0413530.4375000.3055560.3801650.2213110.8142861.00.0000000.3571430.0334390.00.00.01.00.00.00.00.00.00.00.00.00.00.01.00.00.00.00.0
1110.0000001.00.0000000.1090910.51.00.01.01.00.2889320.0750.2355210.4234230.3041830.0150380.3011360.1527780.3140500.2786890.0000000.00.7196110.1785710.0222930.00.01.00.00.00.00.00.00.00.01.00.00.00.00.00.00.00.00.0
1120.1666671.00.1428570.0909090.51.00.01.01.00.2606180.1000.2104250.1891890.2433460.1240600.4147730.2083330.2479340.2622950.1857141.00.4683950.5238100.0111460.00.00.00.00.01.00.00.00.00.00.00.00.01.00.00.00.00.00.0
1130.0000001.00.0000000.0727270.51.00.01.01.00.3577860.3750.2490350.2882880.3041830.1203010.4261360.2083330.2892560.3073771.0000000.00.1021070.6190480.0000000.00.00.01.00.00.00.00.00.00.00.00.00.00.01.00.00.00.00.0